Handle Byte scalar type in quantized conv2d NHWC (#18427)#18427
Handle Byte scalar type in quantized conv2d NHWC (#18427)#18427khazaei wants to merge 2 commits intopytorch:mainfrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18427
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 New Failures, 2 Unrelated FailuresAs of commit 3757d54 with merge base 45a9717 ( NEW FAILURES - The following jobs have failed:
BROKEN TRUNK - The following jobs failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This PR needs a
|
478d713 to
6940ed5
Compare
Summary: Extend the quantized conv2d NHWC operator to accept `ScalarType::Byte` (uint8) inputs in addition to `ScalarType::Char` (int8), enabling unsigned 8-bit quantized inference through the same HiFi code path. Differential Revision: D97819850
6940ed5 to
d77c9f0
Compare
Summary: Extend the quantized conv2d NHWC operator to accept `ScalarType::Byte` (uint8) inputs in addition to `ScalarType::Char` (int8), enabling unsigned 8-bit quantized inference through the same HiFi code path. Differential Revision: D97819850
d77c9f0 to
00878ec
Compare
Summary: Pull Request resolved: pytorch#18427 Extend the quantized conv2d NHWC operator to accept `ScalarType::Byte` (uint8) inputs in addition to `ScalarType::Char` (int8), enabling unsigned 8-bit quantized inference through the same HiFi code path. Differential Revision: D97819850
00878ec to
a9b575d
Compare
…ouped convolutions Summary: Rearrange weights from [OC, KH, KW] to [KH, KW, OC] for depthwise conv2d when `groups == input_channels` . The NNLib depthwise function (`xa_nn_conv2d_depthwise_per_chan_sym8sxasym8s`) expects weights in [KH, KW, OC] layout, but when the weight tensor is 4D [OC, KH, KW, IC/G], it arrives in [OC, KH, KW] order. A temporary buffer is allocated via `allocate_temp_memory` to hold the rearranged weights, which are then passed to the depthwise kernel instead of the original `p_kernel`. Also updates `test_custom_ops.py` to target the `Artemis_HiFi4_UT_v3` backend. Differential Revision: D97833420
Summary: Extend the quantized conv2d NHWC operator to accept `ScalarType::Byte` (uint8) inputs in addition to `ScalarType::Char` (int8), enabling unsigned 8-bit quantized inference through the same HiFi code path. Differential Revision: D97819850
a9b575d to
157ad59
Compare
Summary: Pull Request resolved: pytorch#18427 Extend the quantized conv2d NHWC operator to accept `ScalarType::Byte` (uint8) inputs in addition to `ScalarType::Char` (int8), enabling unsigned 8-bit quantized inference through the same HiFi code path. Differential Revision: D97819850
157ad59 to
3757d54
Compare
Summary:
Extend the quantized conv2d NHWC operator to accept
ScalarType::Byte(uint8) inputs in addition toScalarType::Char(int8), enabling unsigned 8-bit quantized inference through the same HiFi code path.Differential Revision: D97819850